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Abstract 



The Canada Day Theorem is an identity involving sums of k x k minors 
of an arbitrary n x n symmetric matrix. It was discovered as a by-product 
of the work on so-called peakon solutions of an integrable nonlinear partial 
differential equation proposed by V. Novikov. Here we present another 
proof of this theorem, which explains the underlying mechanism in terms 
of the orbits of a certain abelian group action on the set of all fc-edge 
matchings of the complete bipartite graph K n , n - 

1 Introduction 

The "Canada Day Theorem" refers to the following curious combinatorial fact: 
Theorem 1.1. Let T be the n x n matrix with entries 



and let X be an arbitrary symmetric n x n matrix. Then, for 1 < k < n, the 
sum of the k x k principal minors of TX equals the sum of all k x k minors 
of X (principal and non-principal). 

This theorem arose as an unexpected byproduct in our previous paper [8] 
where we studied so-called "peakon" (peaked soliton) solutions to a completely 
integrable nonlinear partial differential equation discovered by V. Novikov [12]; 
a brief account of this can be found in the appendix. The name of the theorem 
refers to its "birthday" (July 1, Canada's national holiday, in 2008). 

A k x k minor in an n x n matrix X — (xij) is determined by a choice of 
/c-element index sets / and J (row and column indices, respectively) from the 
set [n] = {1,2,..., n}. We will use the notation ('■?■') for the set of all fc-element 
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subsets of [n], and write minors as 
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\ X ikh 



for I, J S (^) , where ii < «2 < ■ ■ ■ < ik an d ji < fa < ■ ■ ■ < jk ar e the elements 
of / and J listed in increasing order. A principal minor is a minor with 1 = 3. 

The sum of the k x k principal minors of a matrix A is of course very familiar 
as the coefficient of (—l) k z n ~ k in the characteristic polynomial det(zl — A). The 
sum of all k x k minors is much less frequently encountered; one rare example 
in the literature is [13], but the results there do not seem directly related to 
Theorem 1.1, since they deal with arbitrary matrices where symmetry plays no 
role. 

Example 1.2. In the case n = 3 we have 



T = 




and hence 



TX 



a 

2a + b 



X = 



b 

2b + d 




c 

2c + e 



2a + 2b + c 2b + 2d + e 2c+2e + f, 



The sum of the principal lxl minors of TX is just the trace, (a + 2b + 2c) + 
(d + 2e) + /, which indeed equals the sum of all entries of X (i.e., the sum of all 
lxl minors) . The sum of the principal 2x2 minors of TX is 



a b 
2a + b 2b + d 



a 

2a + 2b 



2b + d 2c + e 

2b + 2d+e 2c + 2e + / 



c 2c + 2e + / 

which, as can be easily verified, equals the sum of all 2 x 2 minors of X 
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And, since det T = 1, the single 3x3 minor of TX (which is of course principal) 
equals the single 3x3 minor of X: det(TX) = det X. 

Example 1.3. For n > 3, the cases k — 1 and k — n of the theorem are still 
easy to verify: for k — 1 we have 

tr(TA) = \ tx{TX) + \tx{{TX) t )=tx{\{T + T t )X) = Y^ X H 



since |(T + T*) is the matrix with 1 in every position, and for k = n we have 
det(TX) = detX as before. However, the intermediate cases 2 < k < n — 1 
are more involved. (The reader might want to check the case n = 4, k = 2 to 
become convinced of this.) 
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Of particular importance for the Canada Day Theorem are minors whose 
row and column indices are interlacing, meaning that 



k < 3x < *2 < h < ■ ■■ < ik < 3k- 



(1.3) 



We abbreviate this condition as I < J. Given sets I, J £ ('■?), let I' = PI J) 
and J' = J \ (J fl J), and denote the cardinality of /' and J' by 



P = p(I, J) = card(J') = card (J') = k - card(I n J). 



(1.4) 



Note that / and J are interlacing if and only if I' and J' are strictly interlacing 
(abbreviated I' < J'), 



With this notation in place, we can state the more precise version of the Canada 
Day Theorem that we are actually going to prove; it says that the two sums are 
both equal to a third sum involving only minors with interlacing index sets: 

Theorem 1.4. Let T and X be as in Theorem 1.1, and let 



Then the following holds: 

(a) The sum of the principal k x k minors ofTX equals S. 

(b) The sum of all k x k minors of X equals S. 

The following is a short outline of the paper. In Section 2 we use the Cauchy- 
Binet formula and the Lindstrom-Gessel-Viennot path-counting lemma to prove 
the easier part of Theorem 1.4, namely part (a), which is true regardless of 
whether X is symmetric or not. Part (b) is proved in Section 3, and here it 
is crucial that X is a symmetric matrix. We introduce a group of "flips" and 
study an action of it on the set of all fc-edge matchings of the complete bipartite 
graph K n ^ n . The main technical point is dealt with in Lemma 3.8, and this 
leads to the characterization of the orbit structure of the group action presented 
in Lemma 3.9, Corollary 3.10 and Corollary 3.11. From these results it then 
follows that when expanding each k x k minor according to the definition of 
the determinant and adding everything up, terms corresponding to orbits of a 
certain type ( "no n- interlacing") cancel out, while the other orbits ("interlacing") 
give contributions adding up to the sum S. 

It is fair to say that the proof given in the present paper is not entirely 
different from the one in [8] . The central concept of (open) clusters introduced in 
Section 3 has its counterpart, namely linked pairs, in the original proof. However, 
the organization of the proof, in particular the identification of the group action 
on the set of all fc-edge matchings as the main underpinning for the Canada Day 
Theorem, is novel. 



A < fi < i'2 < 32 <•••<*« < 3p 



(1.5) 




(1.6) 



i<j 
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2 The sum of the principal k x k minors of TX 



In this section we prove part (a) of Theorem 1.4. Recall the Cauchy-Binet 
formula for the minors of a matrix product [6, Ch. I, § 2]: 



\(TX) AB \= Y, \Tai\\Xjb\, fe^Beft 

^( [1 r ] ) 



(2.1) 



Applying this with A = B = J we can rewrite the sum of the principal k x k 
minors of TX as 



Y \{TX)jj\= £ IT^IIA^I 

J6(W) /,*=('?) 



(2.2) 



Next, we need to compute the minors Here the notion of interlacing index 

sets (as defined in the Introduction) enters. 



Lemma 2.1. Let T be defined as in Theorem 1.1. Then, for I,j£ f^L™')) 



V( 7 - J ), ifI<J, 



0, 



otherwise, 



(2.3) 



w/iere p(J, J) = fc — card(J n J). 

Proof. The matrix T is the path matrix of a planar directed graph of the form 
illustrated below in the case n = 4 (i.e., the entry T a f, counts the number of 
paths - zero, one, or two - from source vertex number a on the left to sink vertex 
number b on the right): 



KD 4 



2 O 



1 O 




O 3 
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»0 1 



By the Lindstrom-Gessel-Viennot lemma [10, 11, 7, 1], the minor \Tjj\ equals 
the number of vertex-disjoint path families from sources indexed by J to sinks 
indexed by /. 

Consider a vertex-disjoint path family from the source set J = {1 < j\ < 
j 2 < ... < jk < n} to the sink set I = {1 < i\ < i<i < ... < < n}. The 
planarity of of the graph allows only paths j m — > i m . Clearly i m < j m , since no 
path can go upwards, and moreover j m < i m +i, since otherwise both the path 
jm — > im and the path j m +i — > hn+i would have to pass through the vertex 
immediately to the left of sink vertex i m +i- Hence, if the interlacing condition 
I < J is not satisfied, then there are no vertex-disjoint path families from J to /, 
and therefore \Tjj \ = in this case. 

Suppose now that I < J. For each m = 1, . . . , k we draw a rectangular 
window over the graph, with its upper left corner at source vertex j m and its 
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lower right corner at sink vertex i m . Imagine trying to construct a vertex-disjoint 
path family from J to /. There will be only one possible path j m — > i m if the 
mth window has its height j m — i m equal to zero, or if it shares its top edge with 
another window; otherwise there will be two possible paths j m — > i m . Indeed, 
when the height is zero there is only one way to start and finish on the same 
level. When the top edge is shared, there is again only one path, since the path 
corresponding to the window directly above must use the vertex immediately 
to the left of sink vertex i m +i (which is on the same level as source vertex j m ). 
Therefore, if we remove all windows which are of height zero or share their top 
edge with another window, then the number of possible vertex-disjoint path 
systems will be given by 2 to the power of the number of remaining windows, in 
other words 2 card ( 7 ') where I' = I \ (I n J). □ 

Now, inserting the value of \Tji\ from this lemma into equation (2.2), we 
obtain 

£ \(TX)jj\= J2 ^ piI ' J) \X IJ \ = S, (2.4) 

J6(W) I,Jg(W) 
I<J 

which finishes the proof of part (a) of Theorem 1.4. 

Remark 2.2. There are several ways to compute the minors \Tjj\; see [8] for 
another argument using induction on n. Note that the proof above is implicitly 
taking advantage of the factorization 



T = 
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3 The sum of all k x k minors of X 

In this section we prove part (b) of Theorem 1.4. 



3.1 Minors and matchings 

By the definition of the determinant, we have 

fx{ 1 j 1 . . . Xi ± j 



\X 



det 



(3.1) 



where Sk is the symmetric group of permutations on k elements (i.e., bijections 
7r: [fc] — > [k]). A less index-heavy notation is obtained by using that (for given / 
and J) each permutation n : [k] [k] corresponds to a unique bijection r : / — > J 
via r(ife) = jV(fc)- Setting sgn(r) = sgn(?r) and X (t) = ]J ieI x iT ^, we simply get 



(3.2) 



r: I->J 



5 



where the sum runs over all bijections from / to J. We visualize such a bijection r 
as a bipartite graph; for example, with n = 5, the bijection r : {2, 3, 4} — > {1,3, 5} 
given by r(2) = 3, r(3) = 5, r(4) = 1 is drawn as 




and it corresponds (for the given sets I and J) to the permutation 7r e 5 3 
represented by the graph 




The sign of r (and ir) is +1 or — 1 depending on whether the crossing number of 
the graph is even or odd. In this example there are two crossings, so sgn(r)X( T ) = 
+^23^35^41 (and this is one of the six terms in the 3x3 minor 1^234,135 1) ■ 

Note that when composing two bijections n : J — > L and t 2 : L — > J, the 
signs obey the same rule as for the corresponding permutations: 

sgn(r 2 o n) = sgn(T 2 ) sgn(n). 

Choosing fc-element sets / and J together with a bijection t: J — > J is 
equivalent to choosing a fc-edge matching of the complete bipartite graph K n , n : 



5 j ^^^^ D 5 5 • JO 5 

2< ^Z^^ 32 2 «r-^\. O 2 
1 % 1 1 • D 1 



(Recall that a matching of a graph X = (V, E) with vertex set F and edge set E 
is a subset F C E such that no two edges in F share a common vertex. One may 
equivalently think of the matching as being the subgraph (V, F). This habit of 
not distinguishing a graph from its edge set, when the underlying vertex set is 
understood, will be used quite frequently.) 

Fix n and k, and let M = M. n .k denote the set of fc-edge matchings of K n , n . 
We will use the same symbol r both for such a matching and for the corresponding 
bijection, and slightly abuse the language by speaking about matchings t: I J. 
If r(i) — j holds for the bijection t, then we say that the matching t contains an 
edge i — > j. (The graph K n ^ n is undirected, but the arrow notation is convenient 



G 



for distinguishing the left nodes labelled 1 , . . . , n from the right nodes also 
labelled 1, . . . ,n.) 

When summing all k x k minors of an n x n matrix X and expanding each 
minor, the whole sum turns into an alternating sum over all fc-edge matchings 
of K n>n : 

E \ X "\ = E E s ^(r)X (T) = J2 sgn(r)X (r) . (3.3) 

iM l t ] ) w( l l l ) T:I ^ J reM 

In order to prove that this equals 

J22 p{I ' J) \Xu\ = j: E 2^sgn(r)X (T) 

I<J I<J t: I->J 

when X is symmetric, as claimed in Theorem 1.4, we will introduce a certain 
abelian group Q which acts on M. in a way which preserves the weights Xr T \ but 
may change the signs sgn(r). Then we compute the sum X^tga-i ^ v adding the 
terms for each group orbit separately and then summing over all orbits. As we 
will sec, each orbit containing a matching r : / — > J with interlacing index sets 
I < J contributes 2 P ^ I ' J ^ terms which all have the same sign, while the other 
orbits contain equally many positive and negative terms and therefore cancel 
out. 



3.2 Clusters of a matching 

Fix a matching t : / — > J (viewed as a bipartite graph). Temporarily add auxiliary 
horizontal edges r — > r for all r € / n J. Split the resulting (multi)graph f into 
connected components, and then remove the auxiliary edges. The remnants of 
the components of f form a partition of the edges of r into what we will call 
clusters. 

Since r is a matching, no vertex in f can have degree greater than two, and 
therefore the components of f are paths. A cluster is either called closed or open, 
depending on whether the corresponding component of f is a closed or an open 
path. The endpoints of an open path will also be said to be the endpoints of 
that open cluster. 

Example 3.1. Let n = 8 and k = 7. Here is a matching r: I — > J, where I = 
{1,2, 3, 4, 5, 6, 8} and J — {1,2,4, 6, 7, 8}, together with its companion f obtained 
by adding auxiliary horizontal edges r — > r for r = ID J = {1,2,4, 5, 6, 8}: 
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Definition 3.2 (Endpoint separation). Let r: / — > J be a matching and C one 
of its open clusters. Consider the list consisting of all numbers in / U J, sorted 
in ascending order. (If i m = j m i for some m and m', then include that number 
twice in the list.) Let the endpoint separation sep(C) of the cluster C be the 
number of entries in this sorted list lying strictly between the numbers labelling 
the two endpoints of C. For closed clusters C, we set sep(C) = 0. 

Remark 3.3. Observe that the endpoint separation is a property of both the 
matching and the cluster, and not just of the cluster itself. 

Example 3.4. The open cluster C\ in Example 3.1, with endpoints labelled 3 
and 7, has endpoint separation sep(Ci) = 6, since there are six numbers (4, 4, 
5, 5, 6, 6) strictly between the 3 and 7 in the list (1, 1, 2, 2, 3, 4, 4, 5, 5, 6, 6, 7, 8, 8). 
For the closed clusters, we have sep(C2) = sep(Cs) = by definition. 

3.3 The group of flips 

Given n, let Q = Q n be the abelian group obtained by taking the direct sum of 
(2) copies of the two-element group Z/(2) (one copy for each pair (i,j) with 
1 < i < j < n). We define an action of Q on the set M. of fc-edge matchings 
of K n ^ n as follows: 

• Let fij — (0, . . . , 0, 1, 0, . . . , 0) € G, where the 1 is in the («, j)th copy of 
Z/(2). The elements {fij}i<j generate Q, so it is enough to define how 
they act. 

• If there is an open cluster C in the matching r € M containing one of the 
edges i — > j and j — > i (note that an open cluster cannot contain both), 
then let fij • t be the matching obtained by flipping the whole cluster C, 
i.e., replacing each edge a — > b in C by b — > a (and leaving all other edges 
in r as they are). 

• Otherwise, let fij do nothing: fij «r = r. 

It is straightforward to verify that this really defines a group action as claimed, 
since flips commute and flipping the same cluster twice is the identity trans- 
formation. Note that it is the flipping of a whole cluster (rather than just an 
individual edge) that ensures that fij • r is still a matching. 

Example 3.5. When /2s acts on the matching r of Example 3.1, the open 
cluster Ci (here drawn dotted) is flipped, since it is open (with endpoints 3 
and 7) and contains the edge 2 — > 8: 
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Lemma 3.6. If the matrix X is symmetric, then X( g . T \ — Xt T \ for all flips 
g € Q and all matchings r e M. 

Proof. This is immediate, since replacing the edge a — > b by the edge b — > a in 
the matching t : / — > J corresponds to replacing the matrix entry X a b by Xb a 
in the product X( T ) = rLe/^Mi)- D 

Example 3.7. The flip in Example 3.5 corresponds to changing 

-X(t) — ^16^28^34^42^55^61^87 

into 

-^(/28«r) = ^16^82^43^24^55^61^78- 

A less trivial aspect of the group action is the relation between sgn(<? • r) 
and sgn(r), and this is where the endpoint separation and the interlacing 
condition I < J will be of importance. 



Lemma 3.8. // the cluster C is flipped when flj acts on t, then 
sgn(/ 4J .r) = (-l) so P( c ' sgn(r). 



(3.4) 



Proof. As remarked in Section 3.1, the composition of matchings n : / — > L and 
r 2 : L — s- J (thought of as functions) satisfies sgn(r 2 on) = sgn(r 2 ) sgn(n). We 
will split r : / — > J into such a composition where n deals with the edges in the 
cluster and r 2 with the remaining edges. 

Consider an open cluster C in r with endpoints a G I' = I \ (I D J) and b £ 
J' = J\ (ID J): 



r(a) = ci, t(ci) = c 2 , t(c 2 ) 



C3, 



"(Cm-l) 



"(c m ) = 6. 



We let A = {ci, c 2 , . . . , c m } c In J and define T\ : I —> I+b—a and r 2 : 7+6— a 
J by 



n(x) 



r(x), if a; € A + a, 

if x e I \ (A + a), 



T 2 (X) 



x, if x € A' + &, 
t(z), if x e J\ (K + b). 

(3.5) 
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(For readability, we have written / + b — a instead of (/ U {&}) \ {a} and K + a 
instead of K U {a}.) These definitions imply that r = t^o t\. Schematically, 
with dashed arrows indicating the mapping x i-> x and solid arrows indicating 
x H> t(x): 



In J 



r 




in J 



Tl 



The result of flipping the cluster C is 
differs from ti in having the edge a 



T2 



b<? 



in J 



= T3 o t x 1 , where t% : I J ■ 
a instead of the edge b —> b: 

i 



I'-ai 





J' -b 



in J 



73 



We have sgn(r 3 ) = (—l) se P( c ) sgn(r2), since if we imagine detaching the edge 
b — > b from its vertices and continuously sliding it to the position a — > a, the 
crossing number of the matching changes by one each time either the left or the 
right end of of that edge moves past a matched vertex, and by the definition 
of endpoint separation there are exactly sep(C) matched vertices on the levels 
strictly between a and b. Together with sgn("r-f ^ = sgn(ri), this proves (3.4). □ 

Let us call a matching r : / — > J interlacing if the sets / and J are interlacing, 
I < J (cf. (1-3)). The orbits of the interlacing matchings (under the action of the 
flip group Q) will be called interlacing orbits, and all other orbits non-interlacing. 

Lemma 3.9. A matching a belongs to an interlacing orbit if and only if every 
cluster C in a has even endpoint separation sep(C). 

Proof. It is clear from the definitions that flipping a cluster does not change 
its endpoint separation. Closed clusters always have even endpoint separation 



11 



by definition (namely, zero). If C is an open cluster in an interlacing match- 
ing r: I ->• J, with endpoints i s £ I' = I \ (I n J) and j t £ J' = J \ (I (1 J), 
then in the case i s < j* we find from the interlacing condition that 

■ ■ • < j s -i < i s < ] s < ■■ ■ <i t < jt < U+i < ■ ■ , 

- v ' 

2{t — s) elements 

so that sep(C) = 2(t — s) is even, and the case i s > jt is similar. It follows that 
if a belongs to the orbit of an interlacing matching, then every cluster in a has 
even endpoint separation sep(C). 

Conversely, given a matching a : A — > B all of whose clusters have even 
endpoint separation, we form interlacing sets / < J by sorting the list of 
numbers (01, . . . , cifc, 61, ... , bk) in ascending order and labelling the elements of 
the sorted list as {i\ , j\ , 12 , ji , ■ ■ • , ik , jk) (note that I (1 J = An B). If a £ A 
and b G B are the endpoints of an open cluster C in <r, then (a,b) £ I x J or 
J x I (rather than I x I or J x J) because of the assumption that sep(C) is even. 
Flipping those open clusters in a whose endpoints belong to J x I produces an 
interlacing matching r : / — > J whose orbit a belongs to. □ 

Corollary 3.10. All matchings in a given interlacing orbit have the same sign. 

Proof. Combine Lemmas 3.8 and 3.9. □ 

Corollary 3.11. In a non-interlacing orbit, there are equally many matchings 
of each sign. 

Proof. By Lemma 3.9, each matching in a non-interlacing orbit has at least 
one open cluster with odd endpoint separation; among those clusters we single 
out the unique one with the property that its lowest-numbered node is smaller 
than that of the other ones. The operation of flipping that cluster is a sign- 
reversing involution pairing up the matchings in the orbit. (It's sign-reversing 
by Lemma 3.8, and it's an involution since the lowest-numbered node will still 
be the lowest-numbered node after the flip.) □ 

Lemma 3.12. (a) The orbit Q>t of a matching r: I — > J contains 2 p ( I,J ^ 
elements, (b) Each interlacing orbit contains exacly one interlacing matching. 

Proof. Recall that p{I, J) = card(J') = card (J') where V = I \ (I n J) and 
J' = J \ (I fl J). Each element of I' is the endpoint of an open cluster whose 
other endpoint belongs to J' (by the definition of an open cluster), while all 
other clusters are closed. Thus there are exactly as many open clusters in r as 
there are elements in /' and J', and each of these p(I, J) open clusters can be 
flipped independently of the others. This proves part (a). To prove part (b) we 
consider two cases. If an interlacing matching r has only closed clusters then 
by (a) card(C* »r) = 1, hence (b) holds. If, on the other hand, an interlacing 
matching r has at least one open cluster then flipping any open cluster destroys 
the interlacing property. This follows from the fact that I U J and I <~) J are 
invariant under flipping, and that there is only one way of constructing interlacing 
sets with given union and intersection. □ 

Now we only have to put the pieces together: 
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Figure 1: A three-peakon solution of the Camassa-Holm equation. 



Proof of part (b) of Theorem 1.4- Let M* C M. denote the union of the inter- 
lacing orbits. Then, referring in brackets to the relevant Lemmas and Corollaries 
above, we have 

\Xjj\ = sgn(r)X( T ) (def. of determinant) 

/,./ I, J t: I-*J 

= E S M T ) X (r) 

= E sgn(r)X (T) + Y s M T ) x (r) 

tEM* t^M* 

= E E E s ^(a)X {a) (def. of X*, 3.12b) 

I<Jt: I^,J oEG'T 

+ (3.6,3.11) 

= Y E 2P(/,J) s § n ( T ) x (r) (3.6,3.10,3.12a) 

I<Jt: I^>J 

-E 2 ^ J) f E sgn(r)X (T) ) 
I<J \t: I^.J ) 

= Y 2 p(/ ' J) \Xij\ (def. of determinant). 



KJ 



□ 
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A Appendix 

The Canada Day Theorem first arose in the context of nonlinear partial differ- 
ential equations in [8]. In this appendix we describe briefly the subject of that 
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paper and how it resulted in the formulation of the theorem. The nonlinear 
partial differential equation 



ut ~ u xxt + 4u u x = 3uu x ii xx + u u xxx , (A.l) 

where u = u(x,t), u x = ^(x,t), u t = ^(x,t), etc., was derived by V. Novikov 
[12] as part of a classification of generalized Camassa-Holm-type equations 
possessing infinite hierarchies of higher symmetries. The Camassa-Holm equation 

Ut u xx t + 3uu x — 2u x ii xx + uu xxx , (A. 2) 

which was formulated by Camassa and Holm in [5] as an integrable shallow 
water equation (with u denoting a horizontal velocity component), admits a 
particularly interesting class of explicit weak solutions which capture phenomena 
such as the collision and breakdown of waves. Miraculously, many of these 
features turn out to be intrinsically connected to classical problems in analysis, 
like Stieltjes continued fractions [3], or the moment problem [4]. These weak 
explicit solutions are called peakons (short for peaked solitons) because of the 
characteristic e - '^' shape of the waves. Multipeakon solutions are formed by the 
superposition of n peakons, 



«(i,t) = ym j (t)e- |I - Iilt)l , (A.3) 



-\X-Xi(t)\ 



with a suitable time dependence in the amplitudes mi(t) and positions Xi(t). 
Observe that since these solutions are not differentiable everywhere they only 
satisfy the PDE (A. 2) in a certain weak sense. The evolution of a three-peakon 
solution over time is illustrated in Figure 1. One of the objectives of [8] was 
to find explicit formulas for the positions and amplitudes of the peaks for the 
multipeakon solution of Novikov's equation (A.l). After substituting the ansatz 
(A.3) into (A.l), taking into account the weak nature of the solutions, one 
obtains 2n ordinary differential equations which govern the time dependence of 
the positions and amplitudes: 




(A.4) 



rrij sgn(xfe - Xj) e <[Xk Xj<[ 



(By definition, sgn(O) = here.) These equations were already stated in [9] where 
it was shown that they constitute a Hamiltonian system, and one of the main 
results of [8] was that they are also Arnol'd-Liouville integrable [2], meaning 
that there exist n functionally independent and Poisson commuting constants 
of motion Hi , . . . , H n . To display these constants of motion we first define the 
matrices 

P = diag(mi,...,m n ), 

E = (Eij)l j=1 , where E v) = eH^'l, (A.5) 
T = (Ty)?j=i, where Ty = 1 + sgn(i - j), 
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where P and E depend on the variables appearing in (A. 4). One can then 
show that for every complex A, the polynomial A(X) = dct(J — XTPEP) is a 
constant of motion for equations (A. 4), which implies that the coefficient of 
X k in the polynomial A(X) is a constant of motion. It can also be shown that 
these coefficients (for 1 < k < n) are in fact Poisson commuting and functionally 
independent, thereby providing the desired set of n constants of motion. By 
elementary linear algebra, the coefficient of X k can be computed (up to a sign) 
as the sum over all k x k principal minors of the matrix TPEP. However, before 
this result was found, direct computations for small values of n had indicated 
that the constants of motion ought to be given by the sums over all k x k minors 
of the symmetric matrix PEP. It was the attempt to reconcile these observations 
that led to the formulation of Theorem 1.1, and as a result, the constants of 
motion now have the following description: 

Theorem A.l. The Novikov peakon ODEs (A. 4) admit n constants of motion 
Hi, . . . , H n , where equals the sum of all k x k minors (principal and non- 
principal) of the n x n symmetric matrix PEP — (miTnje~ ' Xi— x ^)"j =1 . 

As a final remark, let us mention that explicit expressions for the minors of 
PEP can be written down easily with the help of the Lindstrom-Gessel-Viennot 
lemma; see [8] for details. 
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